Query cost estimation through remote system contention states analysis over the Internet

نویسندگان

  • Weiru Liu
  • Zhining Liao
  • Jun Hong
چکیده

Query processing over the Internet involving autonomous data sources is a major task in data integration. It requires the estimated costs of possible query plans in order to select the best one with the minimum cost. In this context, the cost of a query is affected by three factors: network congestion, server contention state, and complexity of the query. In this paper, we study the effects of both the network congestion and server contention state on the cost of a query. We refer to these two factors together as system contention states. We present a new approach to determining the system contention states by clustering the costs of a sample query. We construct two cost formulas for each of the system contention states respectively using the multiple regression process. When a new query is submitted, its system contention state is estimated first using either the time slides method or the statistical method. The cost of the query is then calculated using the corresponding cost formulas. The estimated cost of the query is further adjusted to improve its accuracy. Our experiments show that our methods can produce quite accurate cost estimates of the submitted queries to remote data sources over the Internet.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Determining Remote System Contention States in Query Processing over the Internet

In the environment of data integration over the Internet, three major factors affect the cost of a query: network congestion situation, server contention states (workload), and data/query complexity. In this paper, we concentrate on system contention states. For a remote data source, we first determine the total number of contention states of the system through applying clustering techniques to...

متن کامل

Developing Cost Models with Qualitative Variables for Dynamic Multidatabase Environments

A major challenge for global query optimization in a multidatabase system (MDBS) is lack of local cost information at the global level due to local autonomy. A number of methods to derive local cost models have been suggested recently. However, these methods are only suitable for a static multidatabase environment. In this paper, we propose a new multi-states query sampling method to develop lo...

متن کامل

Run Time Optimizations of Join Queries for Distributed Databases over the Internet

A new probe based run time optimization technique is developed and demonstrated in the context of an Internet based distributed database environment More and more common are database systems which are distributed across servers communicating via the Internet where a query at a given site might require data from remote sites Optimizing the response time of such queries is a challenging task due ...

متن کامل

An Adaptive Probe-Based Technique to Optimize Join Queries in Distributed Internet Databases

An adaptive probe based optimization technique is developed and demonstrated in the context of an Internet based distributed database environment More and more common are database sys tems which are distributed across servers communicating via the Internet where a query at a given site might require data from remote sites Optimizing the response time of such queries is a chal lenging task due t...

متن کامل

QoS-based Data Access and Placement for Federated Information Systems

A wide variety of applications require access to multiple heterogeneous, distributed data sources. By transparently integrating such diverse data sources, underlying differences in DBMSs, languages, and data models can be hidden and users can use a single data model and a single highlevel query language to access the unified data through a global schema. To address the needs of such federated i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Web Intelligence and Agent Systems

دوره 2  شماره 

صفحات  -

تاریخ انتشار 2004